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Abstract. We characterise the evolution of a dynamical system by combining two 
well-known complex systems’ tools, namely, symbolic ordinal analysis and networks. 
From the ordinal representation of a time-series we construct a network in which every 
node weights represents the probability of an ordinal patterns (OPs) to appear in 
the symbolic sequence and each edges weight represents the probability of transitions 
between two consecutive OPs. Several network-based diagnostics are then proposed 
to characterize the dynamics of different systems: logistic, tent and circle maps. We 
show that these diagnostics are able to capture changes produced in the dynamics 
as a control parameter is varied. We also apply our new measures to empirical data 
from semiconductor lasers and show that they are able to anticipate the polarization 
switchings, thus providing early warning signals of abrupt transitions. 

PACS numbers: 05.45.Tp; 89.75.-k; 89.70.Cf; 89.75.Hc 


Quantifying sudden changes in dynamical systems using symbolic networks 2 

1. Introduction 

Symbolic time-series analysis is a powerful technique able to extract hidden features 
such as the presence of frequent recurrent patterns [D [21 a n E], or the existence of 
missing/forbidden patterns [HllTlIH] from stochastic, high-dimensional signals. Symbolic 
analysis has also shown to be useful for classifying different types of signals pcniiiiiin] 
and, in bivariate analysis, for inferring the direction of information flow [mill]. The 
symbolic approach involves the transformation of a time-series, x{t), into a sequence 
of symbols, s{t), by using an appropriated codihcation rule. A signihcant advantage 
of symbolization is its reduced sensitivity to observational noise. Complexity measures 
have been proposed to characterize the resulting symbolic sequence, a very popular one 
being the permutation entropy dsi ESI im Ell Esiisniiii], computed with an ordinal 
symbolization rule, by comparing neighboring values in the time-series. 

Complex networks have also been successfully employed for time-series analysis 
[22l |23l EU ESI EEl EZ]- Correlation graphs |2Sl EH], recurrence graphs [301 EH ES] and 
visibility graphs [33l jSH [35l |36l [37] have been shown to provide relevant information, 
for example, of early-warning indicators of qualitative changes and abrupt transitions. 

Here we characterise the evolution of a dynamical system by combining symbolic 
ordinal analysis and network representation, in a hybrid methodology. Specihcally, 
a time series x(t) (raw data) is transformed in a sequence, s(t), of symbols (ordinal 
patterns, OPs) that is used to construct a directed and weighted graph where the different 
symbols that appear in s{t) constitute the network nodes. Each of these nodes has 
an associated probability of occurrence. The transitions between consecutive symbols 
constitute the network links: any pair of nodes, i and j, are connected by a directed 
link if the sequence of symbols {i,j) occurs in the symbolic sequence, and the weight of 
the directed link is the probability of j occurring after i (referred to as the transition 
probability i ^ j, TP). 

Our approach for dehning the links as the transition probability between symbols 
was originally proposed by Nicolis et al. [22] and the use of ordinal analysis was proposed 
by Sun et al. [27] ; however, in [27] the weights of the links were not dehned in terms 
of TPs (as in [22]), but rather, they were dehned as the number of times the pair of 
consecutive symbols {i,j) occurred in the time series. A major advantage of the use of 
TPs is that they are normalized in each node, and thus, allow for computing an entropy 
for each node, in the following referred to as node entropy. The node entropies can then 
be used to identify changes in the network, which rehect changes in the time-series when 
the control parameter is varied. 

Exploiting this network representation we propose several novel measures to 
characterize a time series: 

• the nodes ’ entropy, which is the mean entropy of the distribution of the weights of 
the outgoing links of a node (in other words, is the node entropy, averaged over all 
the nodes of the network.); 

• the links ’ entropy, which is the entropy of the distribution of links’ weights; 
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• the asymmetry coefficient, which is computed from the difference of the weights of 

the links i ^ j and j ^ i, averaged over all the links of the network 

Note that the nodes’ and the links’ entropies are computed according to the classical 
Shannon entropy dehnition. 

By analyzing numerical data generated from the logistic map, the tent map, and the 
circle map, and also experimental data, recorded from the intensity of semiconductor 
lasers under different operation conditions, we show that these measures are able to 
capture gradual and abrupt changes in time-series. A comparison with the permutation 
entropy (PE) reveals that the entropies dehned from the symbolic network vary over 
a wider range of values and outperform the permutation entropy in providing early 
warning signals of sudden changes in the time-series. 

This paper is organized as follows. In Sections 2 and 3 the method of network 
construction and the network-based measures proposed for characterizing time-series 
are described. In Section 4 results are presented and, hnally, we provide the discussion 
and some conclusions in Section 5. 

2. Network construction 

We transform a time series x{t) into a sequence of symbols s{t) by using the ordinal 
pattern (OP) representation with symbols of length D. In this case, symbols are dehned 
by considering groups of D consecutive values in the time series na. There exist D\ 
different OPs of length D. For example, for D = 2 there are two OPs: x(t) < x(t -|- 1) 
gives symbol ‘01’ and x{t) > x{t + 1) gives symbol ‘10’; for D = 3 there are six possible 
OPs: x{t) < x{t -I- 1) < x{t + 2) gives ‘012’, x{t -f 2) < x{t 4-1) < x{t) gives ‘210’, etc. 

In general, the ordinal representation of x{t) gives a sequence s{t) of OPs with M 
different symbols because depending on the system’s dynamics not all possible symbols 
will be present in the symbolic sequence, s{t), either because the dynamics does not 
allow for some of the (the so-called “forbidden patterns”), or because the time series 
x{t) is of hnite length and simply they do not appear (“missing patterns”) [6l [TJ [8]. 
Thus, the number of nodes in the network corresponds only to those symbols appearing 
in the symbolic sequence and is M < D\. 

The weight of a node i is the relative number of times the symbol i occurs in the 
sequence s{t)\ 

Pi = j^n[s{t) = i], (1) 

^ t=i 

where n is a count of the number of occurrences and L is the length of the sequence 
s{t). The weights are normalized, the permutation entropy, Sp (PE), 

is the entropy of the distribution of node weights [T5] . 

M 

Sp = Pi log Pi. 

i=l 


( 2 ) 
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We note that 0 < Sp < logM, with Sp = 0 if p* = 1 for i = i* and Pi = 0 ^i ^ i* 
(only one symbol, i*, appears in the symbolic sequence and thus the network has only 
one node); Sp = logM if pi = 1/M Vi (all the symbols appear in the symbolic sequence 
with the same probability and thus all the nodes have the same weight). 

The weight of a link i —)■ j, Wij, is the relative number of times, in the sequence 
s{t), the symbol i is followed by the symbol j: 


L-l 

^n[s(t) = i,s{t + l) =j] 

-• ( 3 ) 

^n[s(f) = i] 

t=i 


With this definition the link weights are normalized in each node, i.e., ~ 

We note that this definition allows for the presence of self-loops. 

It is illustrative to describe the type of networks that are generated by simple 
dynamics. A periodic time-series will give a regular symbolic sequence, whose periodicity 
will depend on the length of the ordinal pattern, D. The resulting symbolic network will 
then depend on D. To focus the ideas, let’s consider the time series shown in Fig. [I](a), 
with period 4, generated from the logistic map. With D = 3 we obtain the following 
sequence of symbols 


201 , 021 , 102 , 120 , 201 , 021 , 102 , 


(4) 


Because the TPs are computed from ordinal patterns that are formed by non- 
superposed values [i.e., for D = 3, {x(t), x{t + 1), x(t + 2)} define one pattern and 
{x{t -|- 3), x{t + 4), x{t -l- 5)} define the next one], the network obtained has 4 nodes 
connected as follows: 


201 ^ 120, 021 ^ 201, 102 ^ 021, 120 ^ 102. (5) 

If we use D = A OPs, then the sequence of symbols is 

3021, 0213, 2130, 1302, 3021, 0213, 2130, ... (6) 

and the resulting network has 4 nodes with self-loops, 

3021 ^ 3021, 0213 ^ 0213, 2130 ^ 2130, 1302 ^ 1302. (7) 

It is important to note that, in the ordinal encoding scheme, depending on the value of 
D, an irregular time series can have associated a regular symbolic sequence, and thus, a 
regular network. To £x the ideas, let’s consider the time series shown in Fig. [I](b), also 
generated from the logistic map. With values of D equal to 2, 3 or 4, this time-series 
gives the same symbolic sequence as the period-4 time-series shown in panel (a). This 
feature is a consequence of the ordinal codification rule. Other codification choices can 
result in two different symbolic sequences. 

On the other hand, if the time-series is fully random, the symbolic sequence will also 
be fully random, and then the network will have a regular, all-to-all topology, regardless 
of the value of D (assuming the time-series is sufficiently long). 
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Figure 1. Time series generated from the logistic map with (a) r = 3.5 and (b) 
r = 3.58. In panel (a) the triangles and the squares indicate two ordinal patterns 
formed by two sets of D = 3 consecutive and non-overlapping values, which give the 
transition 201 —>■ 120. 


3. Network-based diagnostics for time-series analysis 

The symbolic network allows defining several novel measures: 

1) Since compute an entropy at each node, 

M 

Si = - ^ Wij \ogWij, (8) 

i=i 

and then characterize the network heterogeneity in terms of the distribution of the Sj 
values. 

We note that, on one hand, s, = 0 if and only if node i has only one outgoing link, 
i*\ in this case, Wij* = 1 and Wij = 0, V j 7 ^ j*, and the symbol j* occurring after 
symbol i is always the same. In this case, in the symbolic sequence, the symbol that 
follows i is fully predictable. On the other hand, if all the outgoing links of node i have 
the same weight, then Wij = 1/ki and s* = log ki (here /c* is the out-degree of node 
h ki = where Aij are the components of the adjacency matrix, Aij = 1 if 

Wij 7 ^ 0, otherwise A^j = 0). The maximum Sj value is logM and corresponds to a node 
that is connected to all the other nodes -including itself- with links that have uniform 
weights. In this case, in the symbolic sequence any symbol can follow i with the same 
probability. 

We propose the use of the first moment of the distribution of s* values, the network- 
averaged node entropy, 

1 ^ 

i=l 

as a novel measure for the analysis of a time-series. We note that 0 < < logM. If 

Sn = 0, then Sj = 0 Vi, and all the nodes have only one outgoing link; therefore, the 
symbol j that occurs after symbol i is fully predictable, Vi. On the contrary, the largest 
Sn value, logM, occurs when Sj = logM Vi. In this case the symbolic sequence is fully 
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random and after any symbol, i, any symbol in the sequence can follow with the same 
probability 1/M. 

We note that the range of variation of the node entropies, s*, and of their average, 
Sn, is the same as that of the permutation entropy, Sp. 

2) From the probability distribution function (pdf) of the weights of all the links, 
p{w), we define the link’s entropy as: 

Si = — p{w) \ogp{w)dw. (10) 


We have estimated p{w) from an histogram of the weights of the existing links 
{wij 7 ^ 0). Large values of si indicate a heterogeneous distribution of link weights, while 
small values of si indicate a narrow distribution of link weights. The delta distribution, 
Wij = w, \/i,j that gives s; = 0 includes two limits, one in which the symbolic sequence 
is fully predictable (each node has only one outgoing link: Vi, Wij* = 1 and Wij = 0 if 
j 7 ^ j *) and the other in which the symbolic sequence is fully unpredictable (the nodes 
are all-to-all connected with uniform weights: Wij = 1/M, Vi, j). Thus, s/ quantihes the 
complexity of the symbolic sequence, as s; = 0 corresponds to both, perfectly regular 
and fully random sequences. 

3) To quantify the asymmetry in the direction of the network links, we introduce 
the asymmetry coefficient, which is defined as: 


I'^q' '^ji\ 

i 

i ji^i 


( 11 ) 


Oc = 0 in a fully symmetric network {wij = Wji, Vi,j) and Oc = 1 in a fully directed 
network (either Wij = 0 or Wji = 0 'ii,f\ This asymmetry coefficient is closely related 
to the concept of reciprocity in graphs [35] . 

We note that for computing these measures, only the nodes and the links that 
are present in the network are taken into account. While absent nodes and links, 
corresponding to missing or forbidden symbols and transitions, could have been taken 
into account with zero weights, we preferred to use this more generic approach because, 
depending on the symbolic transformation used, the complete set of symbols can be 
unknown or even inhnite. 


4. Results 

J^.l. Analysis of synthetic data 

We start by presenting the results of the analysis of simulated time-series for the logistic 
map, the tent map and the stochastic circle map. The equations and parameters are 

• logistic map: Xi+i = rxi{l — Xi); 
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• tent map: Xj+i = rxi if Xi < 0.5, Xj+i = r(l — Xi) if x, > 0.5; 

• stochastic circle map: 0j+i = 0* + p + fc(sin27r0j)/(27r) + fif. Here ^ is a Gaussian 
white noise with unit standard deviation, k is the map parameter, f3 is the noise 
strength and p = —0.23 is kept fixed. We analyze time series of Xi = cfi — 0i_i. 

The analysis is performed with time-series of length L = 6000 and patterns of length 
D = A. The results are robust as long as L is much larger than the number of possible 
links. The value of D is chosen because it allows constructing symbolic networks with 
24 possible nodes and 576 possible links; in contrast, D = 5 gives networks with 120 
possible nodes and 14400 possible links, and therefore, computing the links’ weights 
with robust statistics requires long time-series. We discuss below the influence of D and 
L, when we analyze empirical data. 

Regarding the logistic and tent maps, they have similar bifurcation sequences 
(except for the periodic windows in the logistic map) and thus, assuming similar 
bifurcation sequences give rise to similar dynamical behaviors, we expect to obtain 
similar symbolic networks, which will depend on the control parameter. Indeed, this 
is observed in Fig. [2] that displays, for the logistic map (left column) and for the tent 
map (right column), the bifurcation diagrams (top row), the permutation entropy, Sp, 
and the average node entropy, s„, (middle row) and the links’ entropy, si, and the 
asymmetry coefficient, Oc (bottom row). For both maps Sp and increase in a similar 
way with the map parameter (except in periodic windows). One can note that varies 
over a wider range of values in comparison to Sp. We note that this difference in the 
actual range of variation of Sn and Sp can not simply be normalized over because, as 
discussed in Sec. 3, both, and Sp vary within the same range of values. Also, 
displays more abrupt variations, which can be understood in the following terms: as 
the map parameter increases, the dynamics becomes increasingly chaotic and new OPs 
appear in the symbolic sequence, which result in new nodes and links in the symbolic 
network; however, the frequency of occurrence of the new OPs is initially small, and 
their appearance does not produce abrupt variations in the permutation entropy. But 
the new nodes will not necessarily have small entropies; therefore, they can induce 
abrupt variations in the average node entropy. 

Both, Sp and vary with the map parameter in a qualitatively similar way as 
the Lyapunov exponent (LE). For the logistic map it has previously been shown that 
for chaotic parameters and low D values, Sp behaves as the LE [15]. In addition, for 
piecewise monotone interval maps, in the limit oi D ^ oo, Sp tends to the topological 
entropy [I6|. In the visibility graph, a direct relationship between the network entropy 
and the LE can be established by using the Pesin identity [36]. Therefore, these 
results suggest that the network entropy can indeed be expected to capture nontrivial 
properties of the dynamics; however, determining a direct correspondence between 
different network properties and different kinds of dynamics is not straightforward and 
is left for future work. 
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Figure [3] displays the results of the analysis of simulations of the stochastic circle 
map, when varying the parameter that represents the forcing amplitude (left column) or 
the noise strength (right column). Varying k allows to study the transition to locking. 
For very small values of k the stochastic term dominates and the network contains all 
possible nodes and links, thus, Sp and Sn are maximum (~ log4! = 3.18). As k increases 
the dynamics becomes more deterministic, the number of nodes and links gradually 
decrease (not shown) and Sp and also decrease. At fc ~ 1.5 the transition to locking 
occurs [see the bifurcation diagram in Fig. [31(a)] and the presence of noise results in a 
symbolic sequence that is again fully random, thus, Sp and again reach the maximum 
value. If k is further increased, again an increase in the “order” occurs and Sp and 
again decrease. The right column in Fig. [3] reveals that Sp and capture the monotonic 
increase of “disorder” in the symbolic sequence as the noise strength, /3, increases. 





3.6 3.7 3.8 3.9 

Map parameter 


(f) 




1.2 1.4 1.6 1.8 

Map parameter 


Figure 2. Analysis of the logistic map (left column) and tent map (right column), 
(a), (b) Probability distribution function (pdf) of the variable x (in color scale) vs. the 
map parameter, (c), (d) the average node entropy, (red line), and the permutation 
entropy, Sp (black line). Note that the range of variation of is more than twice that 
of Sp. (e), (f) the links’ entropy, s„ (black line), and the asymmetry coefficient, Uc (red 
line). The analysis was performed with D = 4 and L = 6000. 

Other network measures are consistent with these observations. As seen in Figs.[2](e) 
and[2](f), for the logistic and for the tent maps, when the dynamics is weakly chaotic 
(r 3.6 for the logistic map; r 1.2 for the tent map), the symbolic network has a low 
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link entropy and is highly asymmetric (the asymmetry coefficient is close to 1); as the 
parameter increases the network grows and the link’s entropy increases, accompanied 
by large variations of the asymmetry coefficient; for large parameter values (strong 
chaos) both, the asymmetry coefficient and the links’ entropy decrease, suggesting that 
the network becomes more homogeneous. These results are consistent with a previous 
analysis of the complexity of the logistic map, where the peak values of the various 
complexity measures analyzed occurred at intermediate values of the map parameter 
[39]. For the stochastic circle map, si and Oc [Figs. [3](e) and[3](f)] reveal that the network 
is homogeneous (large link entropy accompanied by small asymmetry coefficient) for 
/c ~ 0, at the locking transition, and for strong noise. 







Figure 3. As Fig. [2] but for the stochastic circle map when the map parameter, k, is 
varied (left column) and when the noise strength, /?, is varied (right column). 


4-2. Analysis of empirical data 

Let us next present the results of the analysis of two empirical data sets. The hrst 
data set was recorded from the output of a semiconductor laser at various values of the 
laser bias current ranging from 5.5 mA to 6.2 mA. The type of laser is a vertical-cavity 
surface-emitting laser (VCSEL) that can emit two orthogonal polarizations and, when 
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varying the bias current, there is an abrupt polarization switch (PS) at about 6 niA: the 
dominant polarization turns off and the orthogonal one turns on, as shown in Fig. H] (a). 
The data consist of 71 time-series of the intensity of one polarization mode recorded at 
hxed values of the bias current, below and above the polarization-switching point. Each 
time series contains more than 10® data points. In order to identify clear trends, each 
time-series was divided in several sections, each containing L data points (L varying in 
the range 10^ — 10® as discussed below), and the network indicators were computed by 
averaging the values in each section. 

Since the empirical data is very noisy, see Figs. |l](b) and (c), all possible pairs 
of symbols {i,j) occur in the symbolic sequence and the network is a regular, all-to- 
all graph. Nevertheless, we will show that the network measures adequately capture 
dynamical changes in the time-series. 
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Figure 4. (a) Pdf of the intensity of the linear polarization mode that turns on vs. 
the laser bias current. Intensity time-traces just before (b) and after (c) the switch. 


Figure |5] presents a comparison of the the network-based measures, with the mean 
value, (/), and the standard deviation, a, of the intensity fluctuations: (/) and a vary 
linearly, and thus, they don’t provide an early warning for the sudden polarization 
switch. It can also be seen that both, Sp and s„, change the slope of initial linear trend 
before the PS, but the Sn trend decreases faster, providing a better indicator of the PS. 

While the evidence of early warning in terms of a single network measure is not 
strong, taken together these results provide clear early warnings of abrupt switching. 
This suggests that the switching involves deterministic light-matter interactions that, 
despite of the highly stochastic character of the signals analyzed, the network measures 
are able to capture. In VCSELs the two linear polarizations are strongly anti-correlated 
and the polarization switching can be in part due to thermal effects (Joule heating) 
through a shift of the gain maximum [iQ]. Also, small cavity anisotropies can play 
a role, as the VCSEL circular transverse geometry provides no polarization selection 
mechanism. In addition, material birefringence, saturable dispersion and a microscopic 
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spin-flip relaxation mechanism can explain the PS [H]. Because various sources 
of noise are ubiquitous in laser systems (spontaneous emission noise, thermal and 
electrical noise), it is not possible to investigate experimentally the relative impact of 
stochastic and deterministic mechanisms that trigger polarization switchings. Various 
models suggest that several laser parameters can modify the linear stability of the two 
polarization modes, while small variations (deterministic or stochastic) are responsible 
for triggering the switch. 

Figures [6] and [7] analyze the influence of the length of the ordinal pattern, D, and 
the number of data points, L. In Fig. [6] we note that the above observations are robust 
with respect to D, while in Fig. [71 we note that if L is too short, the network measures 
fail to provide an early warning. 

To conhrm these observations we now consider a second empirical data set, in 
which the intensity of one polarization mode was recorded while the control parameter, 
the laser pump current, was linearly scanned across the switching point. The data 
was recorded from the output of a different VCSEL, which was subject to polarized 
optical feedback. In this VCSEL the polarization switching (shown in Fig. |8]) occurred 
only if the feedback was strong enough; without feedback no polarization switching was 
observed in this device. This setup has therefore the advantage that, by hnely tuning 
the feedback strength, we could record two sets of time-series: one set in which a PS 
occurs, and, by decreasing slightly the feedback strength, another set in which no PS 
occurs. 

We recorded 1000 independent realizations (each time-series has 20000 data points), 
and in each time-series the control parameter, the laser current, varied from a value 
below the PS to a value above the PS (see Fig. [H]). For comparison, a second data 
set (also of 1000 time-series with 20000 data points) was recorded with slightly weaker 
feedback, such that no switchings were observed. 

We divided each time-series in non-overlapping sections and computed the network 
measures in each section. As we are interested in anticipating the switching in real time, 
we want to consider sections as short as possible, but on the other hand, they can not 
be too short as we need to compute the weights of the links with good statistics. We 
found that using sections of L = 500 data points each, and ordinal patterns of D = 3, 
provided a good compromise, because we have 6 nodes, 36 links and their probabilities 
can be computed, in each section, with sufficient statistics. 

The results are presented in Fig. [9l which displays de distribution of Sp, Sn and Oc 
values computed over the 1000 independent realizations: the left column corresponds to 
the data set in which PS occurs, while in the right column, no PS occurs. We observe 
that Sn varies over a wider range of values with respect to Sp and, when approaching the 
switching, both and Sp increase reaching similar values. The asymmetry coefficient 
decreases only slightly when approaching the PS and thus is not a good indicator of the 
switching ahead (this is probably because of the short section length, see the influence 
of L in the analysis of the hrst empirical data set. Fig. [7D. 
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Figure 5. Early indicators of abrupt polarization switching (PS). The data analyzed 
is as in Fig. 01 (a) the mean value and (b) the standard deviation of the intensity 
fluctuations; (c) the permutation entropy, (d) the average node entropy, (e) the links 
entropy and (f) the asymmetry coefficient. Sp and are normalized to the maximum 
value, log I?!. The analysis was performed with D = S. The error bars indicates 
the standard deviation of the distribution of values computed over 100 sections with 
L = 1000 data points each (see text for details). It can be seen that Sp, and Oc 
are good indicators of the abrupt switching, as they change the linear slope before the 
PS; in contrast (/), cr/ and si are not good indicators as they either vary linearly or 
fluctuate with constant amplitude. Sp varies nonlinearly before the PS but in a smaller 
range of values as compared with s„. 


5. Conclusions and discussion 

We have used a hybrid methodology to characterize the evolution of a dynamical system 
that combines two mathematically grounded tools for time-series analysis: symbolic 
ordinal analysis and network representation. Specihcally, time series were transformed 
in sequences of ordinal patterns that were used to construct directed and weighted 
graphs. Exploiting this graph representation we propose several novel diagnostics to 
characterize time series: the averaged node entropy, the entropy of the distribution of 
links’ weights, and the asymmetry coefficient. The analysis of numerical data (generated 
from the logistic, tent and circle maps) and experimental data (recorded from the output 
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Figure 6. Influence of the length D of the ordinal pattern. The data analyzed is as 
in Fig. m The left column presents results for D = 2, the central column, for D = 3 
and the right column, for D = A. The top row displays the permutation entropy, the 
central row, the average node entropy, and the bottom row, the asymmetry coefficient. 
For easy comparison Sp and are normalized to the maximum value, logZI!. The 
error bars indicates the standard deviation of the distribution of values computed over 
100 sections with L = 1000 data points each (see text for details). The behavior of the 
network measures is robust for D = 3 and D = A and even for D = 2, Sn anticipates 
the polarization switching. 


of semiconductor lasers under different external perturbations) demonstrated that these 
network-based measures are suitable for characterizing time-series, adequately capturing 
the effects of parameter changes that result in subtle or sudden transitions. 

The links entropy is an interesting quantity for measuring the complexity of the 
symbolic sequence associated to a time-series, because it is equal to 0 when the weights 
of the links are delta-distributed, which occurs when the symbolic sequence is either 
perfectly regular or fully stochastic. In agreement with a previous study that found 
maximum complexity of the logistic map at intermediate values of the map parameter 
[39] . here we found that the links’ entropy is maximum at intermediate values of r. 

The analysis of the empirical data confirmed that network-based measures provide 
early warning signals of abrupt transitions, despite of the highly stochastic character 
of the signals analyzed. These network-based networks can thus complement other 
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Figure 7. As in Fig. [6] but now we analyze the effect of the length of the time-series: 
in the left column, L = 500; in the central column, L = 1000 and in the right column, 
L = 5000. In each panel D = 3. We note that, as the length of the data set increases, 
the size of the error bars diminishes. 



Figure 8. Second empirical data set recorded from a semiconductor laser with 
orthogonal optical feedback: pdf of the intensity (in color code) of the linear 
polarization mode that turns off vs. time. The pdf is computed by averaging over 
1000 time-series. 
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PS data NO PS data 








Figure 9. Comparison between 1000 independent realizations in which polarization 
switchings occur (left column) and 1000 independent realizations in which no switching 
occurs (right column, see text for details). Top row: the permutation entropy; middle 
row: the average node entropy; bottom row: the asymmetry coefficient, Oc- The 
analysis is performed with D = 3, and each time series is divided in 40 non-overlapping 
sections oi L = 500 data points each. Sp and are normalized to the maximum value, 
log I?!. While the dispersion of values is quite large (due to the short length of each 
section), in the left column there is a clear trend in the distributions of Sp and s„ values 
when approaching the switching point. In comparison the variation of Uc is small and 
thus, Uc does not constitute a good indicator of the PS. We observe that varies over 
a wider range of values with respect to Sp and, when approaching the switching, both 
s„ and Sp increase reaching similar values. 


indicators, such as the standard deviation or the permutation entropy. The methodology 
proposed here can be a valuable tool for the analysis of a wide range of systems 
where critical transitions might occur, including population extinctions, desertihcation, 
wetland degradation and epileptic seizures, to name a few. 

Our approach could allow identifying changes in the symbolic dynamics of these 
systems, for example, the appearance of new symbols or the appearance of new 
transitions, which result in variations of the network measures. Bifurcations from a 
chaotic attractor to another chaotic attractor could be identihed if the two attractors 
have associated symbolic dynamics which have different statistical distribution of 
symbols and transition probabilities. On the contrary, if a bifurcation does not result 
in changes in the symbolic dynamics of the system, the network measures proposed 
here won’t be able to capture the bifurcation (see, e.g., the two time-series displayed in 
Fig. [H which, with D being 2, 3 or 4, ordinal patterns are encoded in the same symbolic 
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sequence). 

Another important drawback is that both, periodic and irregular time series could 
eventually be mapped to the same class of regular networks. Note that this methodology 
is not intended to classify between different time series, but to detect changes in the 
time series as a function of the tuning parameter. Thus, this methodology should be 
only used to complement other conventional tools of time-series analysis if the goal is a 
classihcation. In spite of these drawbacks, our approach provides additional predictive 
power for time-series analysis, as identifying the nodes which have one (or a reduced 
number) of outgoing links will allow identifying “predictable symbols” in the symbolic 
sequence, those for which we know which is the next most probable symbol in the 
symbolic sequence. If ordinal analysis is used to construct the symbolic network, then 
the information gained about the next most probable symbol will not allow inferring 
the variations of values in the time-series, but it will allow inferring the shape of the 
future oscillation in the time-series. In this sense, the symbolic network-based analysis 
proposed here will provide complementary information to that gained by using other 
well-established tools for time-series analysis. 
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